Improving Ranking Using Quantum Probability

نویسنده

  • Massimo Melucci
چکیده

Data management systems, like database, information extraction, information retrieval or learning systems, store, organize, index, retrieve and rank information units, such as tuples, objects, documents, items to match a pattern (e.g. classes and profiles) or meet a requirement (e.g., relevance, usefulness and utility). To this end, these systems rank information units by probability to decide whether an information unit matches a pattern or meets a requirement. Classical probability theory represents events as sets and probability as set measures. Thus, distributive and total probability laws are admitted. Quantum probability is a non-classical theory nor does admit distributive and total probability laws. Although ranking by probability is far from being perfect, it is optimal thanks to statistical decision theory and parameter tuning. The main question asked in the paper is whether further improvement over the optimality provided by probability may be obtained if the classical probability theory is replaced by quantum probability theory. Whereas classical probability (and detection theory) is based on sets such that the regions of acceptance / rejection are set-based detectors, quantum probability is based on subspace-based detectors. The paper shows that ranking information units by quantum probability differs from ranking them by classical probability provided the same data used for parameter estimation. As probability of detection (also known as recall or power) and probability of false alarm (also known as fallout or size) measure the quality of ranking, we point out and show that ranking by quantum probability yields higher probability of detection than ranking by classical probability provided a given probability of false alarm and the same parameter estimation data. As quantum probability provided more effective detectors than classical probability within other domains that data management, we conjencture that, the system that can implement subspace-based detectors shall be more effective than a system which implements a setbased detectors, the effectiveness being calculated as expected recall estimated over the probability of detection and expected fallout estimated over the probability of false alarm.

منابع مشابه

Developing the Quantum Probability Ranking Principle

In this work, we summarise the development of a ranking principle based on quantum probability theory, called the Quantum Probability Ranking Principle (QPRP), and we also provide an overview of the initial experiments performed employing the QPRP. The main difference between the QPRP and the classic Probability Ranking Principle, is that the QPRP implicitly captures the dependencies between do...

متن کامل

The Quantum Probability Ranking Principle for Information Retrieval

While the Probability Ranking Principle for Information Retrieval provides the basis for formal models, it makes a very strong assumption regarding the dependence between documents. However, it has been observed that in real situations this assumption does not always hold. In this paper we propose a reformulation of the Probability Ranking Principle based on quantum theory. Quantum probability ...

متن کامل

Using the Quantum Probability Ranking Principle to Rank Interdependent Documents

A known limitation of the Probability Ranking Principle (PRP) is that it does not cater for dependence between documents. Recently, the Quantum Probability Ranking Principle (QPRP) has been proposed, which implicitly captures dependencies between documents through“quantum interference”. This paper explores whether this new ranking principle leads to improved performance for subtopic retrieval, ...

متن کامل

A Novel Re-ranking Approach Inspired by Quantum Measurement

Quantum theory (QT) has recently been employed to advance the theory of information retrieval (IR). A typical method, namely the Quantum Probability Ranking Principle (QPRP), was proposed to re-rank top retrieved documents by considering the inter-dependencies between documents through the “quantum interference”. In this paper, we attempt to explore another important QT concept, namely the “qua...

متن کامل

Lazy Learning for Improving Ranking of Decision Trees

Decision tree-based probability estimation has received great attention because accurate probability estimation can possibly improve classification accuracy and probability-based ranking. In this paper, we aim to improve probability-based ranking under decision tree paradigms using AUC as the evaluation metric. We deploy a lazy probability estimator at each leaf to avoid uniform probability ass...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل
عنوان ژورنال:
  • CoRR

دوره abs/1108.5491  شماره 

صفحات  -

تاریخ انتشار 2011